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ABSTRACT 



Work in three areas comprised this study of computer 
use in thematic indexing for music research: (1) acquisition, 

encoding, and keypunching cf data--themes of which now number about 
50,000 (primarily 16th Century Italian vocal music) and serve as a 
test base for program development; (2) development of computer 
programs to process this data; and (3) development of special 
comouter typography for music printing. It is recommended that work 
in each of these three areas be continued as desirable and feasible 
in musicological research. (Appended are lists of papers and articles 
summarizing work on this project, samples of music typography showing 
incipits in interval sequence order, and a computer print-out showing 
composer, title, page, source, genre, and serial number.) (JMC) 
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SUMMARY 



This project comprised the third year's work of a three-year research 
proposal. (The first year's work was carried out under USOE Project No. 7- 
8276, Grant No. 1-7-078272-3898 and the second year's work under USOE Project 
No. 8-B-089, Grant No. OEG 0-8-080089-4581). The original purpose of this 
research was to develop computer programs, necessary data and procedures for 
using the computer in a special field of music commonly known as thematic 
indexing. During the third year's work this purpose h-s been extended toward 
a broader study of computer techniques for music research Including the devel- 
opment of special computer typography for music printing. 

Work in three areas was carried out simultaneously during the third year's 

work: 



1. Acquisition, encoding and keypunching of data. The data bank of 
themes, now numbering some 50,000, serves as a test base for the programs de- 
veloped under the project and is available for consultation by other 
researchers. 

2. Development of special programs for information retrieval, pattern 
matching and organization of data for publication by direct photo copy of 
computer output. Attempts to develop programs for precise pattern matching 

of melodies (treating the melody as a wave form) have not yet proved successful. 

3. Development of necessary hardware, programs and procedures to permit 
printing of music on the high-speed computer printer. Special type slugs for 
the printer were designed and manufactured in a joint effort with the IBM 
Corporation's Glendale Laboratory in Endicott, New York and are row available 
to the profession. A sample of music printing is appended to this report as 
Exhibit A. 

An appendix lists five articles and five papers summarizing work under this 
project during the past three years. 




- 4 - 



BACKGROUND FOR THIS RESEARCH PROJECT 



This project comprised the third year's work of a three-year research 
proposal. The original purpose of this research was to develop computer pro- 
grams, necessary data and procedures for using the computer in a special field 
of music commonly known as thematic indexing. During the third year's work 
this. purpose has been extended toward a broader study of computer techniques 
for music research including the development of special computer typography for 
music printing, 

A summary of the background work for the three-year project was given in 
the final report to the first year of funded research under project number 7-8276, 

USOE Grant No. OEG 1-7-078272-3898. This summary narrated the traditional "hand" 
methods of developing thematic indices of music, including the preparation of 
the pilot project which had been carried out to test the feasibility of automating 
these operations. The final project was carried out using the 4,000 opening themes 
(known as inclplts) from 1,000 four-voice pieces of a 16th century Italian reper- 
tory known as the f rot tola . This music was encoded in the machine readable Ford- 
Columbla Music Representation along with data on composer, title, sources, etc. 

A computer program was written which extracted the interval sequence from the 
representation. (For example if the melody ascends the interval of a second, a 
+ 2 is printed; if it descends into a second a -2 is printed, etc.) The Interval 
sequence Is calculated through the aeventh interval and these sequences can in 
turn be arranged in numerical order, thus making it possible to group together 
melodies with the same contour. The pilot study showed that heretofore unidenti- 
fied duplications and borrowings in the frottola repertory do Indeed exist and 
proved the feasibility of applying these techniques to much larger repertories. 

During the first year of the funded project a large amount of data was ac- 
quired, encoded and keypunched. Programs were written to extract greater varieties 
of Information and to permit more effective formats in the computer print-outs. 

Other musicologists were Invited to submit incipits to the data bank which had 
grown to about 25,000 themes. During this period a Conference on Thematic Indexing 
was held in Washington, D.C., on November 4, 1967, and was attended by twenty-three 
musicologists and music librarians with Interests in thematic indexing. 

During the second year of the funded project there was (1) continued acquisi- 
tion, encoding and keypunching of data, (2) continued merging of materials from 
other researchers, with the data bank reaching a total of about 40,000 cards of 
information, (3) continued comparison of print-outs and pattern matching with orig- 
inal music to determine effectiveness of the research design and (4) work was 
begun on development of special hardware, programs and procedures to permit printing 
of music on the high-speed computer printer. 

The development of the special musi typography, mentioned in (4) above at- 
tracted wide Interest and as itt iapo 4ncf became increasingly evident it has 
modified the direction of the overall proje t. In addition to the interest in the- 
matic Indexing as such, the project has broadened to a study of computer applications, 
including the development of new hardware, which can be of value to a wide range of 
music research Interests. These interests Include not only musicological work, but 
also potential applications to libraries, secondary music education and certain 
phases of music publishing. 
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RESEARCH PROCEDURES OF THE 1969-1970 PROJECT 



As in the first two years of the project several procedures were carried out 
simultaneously during the 1969-1970 work. These are listed below with indication 
of changes in emphasis and comparison to earlier work. 

1 . Continued acquisition, encoding and keypunching of data . 

The files of data which have been carefully compiled during the past four 
years represent the most extensive collection of machine readable information 
on early music available anywhere. This is not a "closed" file but rather 
a data bank which will be expanded in the future either by the principal 
investigator himself or with the use of student help available through the 
Music Department. Music colleagues from other parts of the country have also 
contributed to this data bank and it is expected that this collaboration will 
continue. An example of a welcome Interest in collaboration is that of 
Mr. Paganelll, the music librarian at the Conservatory of Music in Bologna, 
Italy, who is preparing an extended listing of lnclpits of anonymous 16th 
century Italian material held by hia library. 

During the latter part of this last year’s project the principal investigator 
visited music colleagues in Italy, Hungary and Czechoslovakia, describing 
the work being carried out under this project and inviting participation in 
the data bank. He was encouraged by the response to these visits. In addition 
to the interest of Mr. Paganelll, mentioned above, it appears that material 
will be forthcoming from Professor Filip at the University of Bratislava, 
Czechoslovakia. Every effort will ba made to get correctly encoded material 
from these sources in order to enrich the local data bank. 

2 . Modifications in the search for pattern matchinga in the original music . 

During the pilot project and the first year of the project considerable time 
was spent comparing the pattern matchings and other computer citations with 
the original music in order to test the validity '*( ’ original thesis of 

the project. As the total oiount of encoded material expanded this became a 

■ tedious and time consuming procedure and it was e:.p>. ilence with this routine 
which was one of the motivations for developing a computer printing of the 
music itself. Because the computer printing is proving successful ve have 
postponed further comparisons of pattern matchings with original music except 
in those instances where the matchings seem particularly precise or have cer- 
tain interesting characteristics. 

3. Continued development of necessary hardware programs and procedures to 
permit printing of music under b lah speed computer printer . 

The development of the means of tainting music on the high speed computer 
printer has taken on increasing ii:€rest and importance throughout this 
project. The computer cen*- r iiiUuccd the developt ent and purchase of a 
minimum set of type characters ( it a cost of $2400 J '’•on the IBM Corporation 
Glendale laboratory in nearby Endiioli, Hew York. Inese were delivered in 
Spring 1969 and programing was begun to permit their use. This prcgrsmmlng 
proved to be quite complex since the computer paper moves in only one direc- 
tion and thus the program must first develop a two-dimenaional array in 
storage and thenproperly distribute the printing of the characters as the 
paper moves through. The program has also been devised so that, for example, 
there la more space following a whole note then following a half note, etc. 

This spacing of notation is Important to appearance and legibility. 
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Throughout the year we have experimented with various formats and arrange- 
ments of materials using the special type characters. From the Moore 
Business Forms Company we ordered a supply of 40,000 pages of specially de- 
signed preprinted staff paper. Since the computer ia "pin driven" (by means 
of equally spaced holes on the left and right edges of the paper) it is 
possible to maintain a very accurate registration of type and staff lines 
for thousands of pages. 

Exhibit A, attached, shows a sample page (or a xerography copy thereof) of 
the most recent printing of music by computer. We have type characters per- 
mitting printing of whole notes, upper and lower stemmed half and quarter 
notes, whole and half rests, flags for smaller note values, dotted notes and 
bar lines. It is obvious that this is a minimum set of type but it is adequate 
for a simple melody such as those used in indexing a 16th century music. I* 
will be necessary however to develop more characters in the future to fully 
utilize the process. In June 1970 consulatlons were begun with the IBM Glen- 
dale Laboratory to investigate the possibility of developing a few more 
characters, especially good and flat sharp signs, eighth rest and quarter rest. 

5. A beginning has been made in the use of the computer typewriter terminal 
for the inputting of data to the computer. A typewriter is faster than a 
keypunch and permits on-the-spot corrections and changes more readily. At 
present the data can be typed using a RAX program which simply stores the 
data and then permits transference of the data to cards during the night 
when the computer is available for "batch processing" of materials. When we 
move to a larger computer (an IBM 360/67) in September, 1970, it will be 
possible to enter the data directly into the project's own data bank. 



RESULTS OF THE PROJECT 



1. A very large data bank of early music (primarily sixteenth-century Italian 
vocal music sources) has been developed during the three years of the project 
and is available to other researchers as well as serving as test material for 
computer programs written under this project. About 50,000 themes are stored 
in the data bank along with composer and title information on <. /er 8,000 com- 
positions. Listing by composer, titles and other formats are available to 
researchers. Exhibits "B" and "C: show examples of such listings. In 
Exhibit "B" composers are listed alphabetically with titles listed alphabeti- 
cally under each composer- Exhibit "C" is a double spaced alphabetical listing 
of ail titles, with single spacing when a title is duplicated. This arrange- 
ment permits quick checking of duplicate titles, jony of them examples of two 
settings of the same text by different composer r ther listings which have 
been generated include arrangements by genre and by source. 

2. Effective programs have been developed and are operative to compute interval 
sequences from incipits and to arrange and format data according to Interval 
sequence. The basic programs for arranging by interval sequence were developed 
during the first year of the project and examples were appended to the final re- 
port of the first year's work. Since that time much effort has gone into 
developing more sophisticated programs to make possible arrangements of incipits 
taking into consideration both pitch and rhythm. The basic theoretical idea 

is to treat melody as a wave form developed on the x and y axes of a graph with 
rhytta as the x (horizontal) axis and pitch as the y (vertical) axis. Given 
this mathematical representation of a melody it should be possible to use pat- 
tern matching programs to find identical or similar melodies, even when melodies 
have been transposed or slightly changed. Progress has been made on this phase 
of the project but to date it is not possible to report a completely satisfactory 
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result because of programming complexities in translating the Ford-Columbia 
representation of the music into a graphical representation. Work on this 
important phase of the project will continue in the coming year. 



3. A music typography for the high-speed computer printer has been developed. 
These special characters are now available from IBM and the following listing 
indicates the catalog number for each character. Note that some characters 
require two type slugs. 



Slug 836288 Dot, ups tern, vertical 
portion of breve rest 



Slug 836291 eighth note 
f lags 



Slug 836289 upstem and downstem segments 

Slug 836290 upstem and downstem segments 
eighth flag, upstem 



Slug 836292 Half and whole 
rests 



Slug 836293 Black note heads 



Slug 836294 Open note heads 



It is expected that in the academic year 1970-1971 several new characters will 
be added to the above set. The first to be developed will be quarter rest, 
eighth rest and tha flat signs (on line and on space) . 



CONCLUSIONS AND RECOMMENDATIONS 



1. This project has demonstrated the desirability and feasibility of 
using the computer as a tool in muslcologlcal research, with emphasis on 
its use in thematic indexing. The effort concentrated on three areas: 
a) acquisition of data, b) development of computer programs to process this 
data and c) development of special hardware (equipment) components to make 
possible the printing of music by the high-speed computer printer. It is 
recommended that the following work be continued in each of these three areas: 

a. Acquisition of da ta. Although the present data batik is large enough 
for demonstrating the effectiveness of programs as they are developed it is 
by no means a complete listing of sixteenth-century Italian vocal musical. 

There is a need in the profession for complete thematic Indices of the madrigal, 
the motet and of other repertories. Expansion of the present listing in any 

of these areas will necessitate work not only by the present Investigator but 
by other contributing scholars. It may be desirable in the future to formalize 
this cooperation by setting up an International committee within a professional 
organization such as the International Muslcologlcal Society. 

b. Development of computer programs to process data . As indicated earlier 
in this report there is stil need for more sophisticated programs to order 
melodic incipits in a manner permitting precise citations of motchings and simi- 
larities. The succes ,xul solution of this problem may require consultation 

with persons in mathematics or outside programmers who have had extensive experi- 
ence in problems of pattern matching. 

c. Development of special hardware (equipment) components . At present 
the emphasis in hardware development in this project lies in the area of music 
typography. This should be continued until a large enough set of music symbols 
is available to permit printing of any standard music notation. Because of 
limited space on the print chain of the computer it will eventually be necessary 
to purchase a chain devoted only to music research and not shared with other 
disciplines as at present. 



2. It has been increasingly evident to this researcher that there are 
potential library applications of this research. Although the thematic 
index is a standard research tool, it is expensive to print in the usual for- 
mat. The music typography described above makes it possible to print the 
standard library card (3x5) containing music notation as well as textual in- 
formation. There are many potential applications, including folk song 
repertories, popular music, lieder, to name but a few, which deserve investi- 
gation. 

3. Finally, it is recommended that the computer typewriter terminal be 
investigated as a means of entering data and of performing certain calcula- 
tions in this research. A limited amount of work has been done on the 
Binghamton campus to explore this possibility and with the installation of a new 
computer in Fall 1970 it is hoped to pursue this further. The present project 
depends on card input. It is still not clear that replacing card input with 
typewriter terminal input would result in a clear advantage, but it is a 
possibility that should be explored. Ultimately, the use of terminal input 

and output will make the data bank directly accessible (by phone lines) to 
scholars at distant points. 
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Summary listing of papers, articles and reports 
covering vork in using the computer in music research 
during three years of funded research (from USOE) . 1967-1970 



Articles 

"Musicology and the Computer: The Thematic Index," in Computers in Humanistic 
Research (E. Bowles, editor), Prentice-Hall, 1967, pp. 184-192. 

"Some Criteria for Preparation of Thematic Indices by Computer," in 
Elektronische Datenverarbeltung in der Musikwissenschaft . (H. Hackmann, editor) 
Regensburg, 1967, pp. 57-62. 

"The Thematic Index: A Computer Application to Musicology" in Computers and 
the Humanities . Vol. 2, No. 5, May, 1968, pp. 215-220. 

"A Computer Application in Musicology: The Thematic Index", Information 
Processing 68 (International Federation of Information Processing Societies), 
Amsterdam, 1969, pp. 957-961. 

"The Computer and Music Research: Prospects and Problems," in B ulletin of 
the Council for Research in Music Education . Fall, 1969, pp. 1-9. 



Papers 

"The Frottole Repertory: A Pilot Study in Information Retrieval," national 
meeting of American Musicological Society, New Orleans, December, 1967 

"A Computer Application to Musicology: The Thematic Index," International 
Federation of Information Processing Societies, Edinburgh, Scotland, 

August 15, 1968. 

"A Computer Produced Thematic Catalog," New York State Chapter of the Music 
Library Association, Ithaca, New York, November 2, 1968. 

"Recent Developments in Computer Applications to Musicology," Washington, D.C. 
Chapter of the American Musicological Society, February 18, 1969. 

"Teaching Computer Applications in Musicology," Symposium on Introducing the 
Computer into the Humanities, IBM Department of Education, Poughkeepsie, 

July 1, 1969. 



(Editor) The Computer and Music , 354 pp., Cornell University Press, 1970. 
(Preface by editor) 
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EXHIBIT m B" 



COM POSE U TITLE 

'/ 

1 


PAGE 


SOURCE 


' GENRE SE UAL H 


4 

1 

i 


CH SIAM HIM D 1 I N TORNO 


OA 5 


T0HCH2 


HL0260430 


J a.r.'UKf 


fcnatg K) LUNGAMFNTE 


CIN1504 




HL00 7005B 


1 GCROCE 


S' A LA GELATA M I A TiMIOA 


El N1503 




HL006993 8 


j gcroce 
u 


SARA OCN 1 RG 


FIN1504 


- — - — 


HL00 70 13B 


*J c.CRnr. r 


S IGNOR N EL TUG f*UR0 R 


FIN1504 




HLOO 70038 


GCRnCE 


SIGNOR N()\ MI RIPRjNOER 


EIN1504 




HL00 7004B 


I GCROC E 


SOCCOR IM | SIGNOR 


EIN1504 


— 


HL0070 178 


GCROCE 


T IRS l MOR I R VOLGA 


F l N1503 




HL007000B 


;| GCROCE 

I;! 


TO SERETO GENTILE 

r 


EINI503 




HL 00 6987 8 


j{ GCROCE 


VAGHI G CAR I AUGELL IN I 


E I Nl 503 


— 


~ HL006989 8 “ 


jl GCROCE 
1" 


VALL I PPflFUNDE 


E f Nl 503 




HL00698 3 B 


1 GCROCE 


V ERM T GL I E ROSE 


EINI503 




HL006988 8 


\ F CRT C I A 


CON LEI FUSS* II) OA CHE SI PARTE 


36 


542/ 1 7 ~ 


MADR IGHL040940 8 '* 


I FCRICU 

.if 


DONNA VOSTRA BEL TA S' A DIR LO 


33 


542/17 


MADPJGHLO 409378 


F Cfi T r. u 


NON VED' hdggi »l mio .sole 


29 


542/ 17 


MAOR IGUL040933B 


f crt r. r \ 


PFRCHE LA VITA E RREVE E L ' I NG ~ 


31 


“542/ 17 


MADR IGHL0409 35 0 


FCRTf. I A 


VIVACE F14MMA CUE CON BUBBIA 


30 


542/17 


MAOR IGHL0409 348 


AC ALU A 


BF;AT A NOB I S GAIJOIA 


P. II 


CHWK060 


HYMN HL0404 76B 


ACALUA 

I 


CHRISTE REDEMPTOft OMNILM' 


P.38 


CHWK060 


HYMN ' HL0404868 


j ACALUA 


TIBI CER ISTE SPLENDOR 


P.35 


_ CHWK060 


HYMN HL040485B 


j ACALIJA 


UT OOF ANT LAXIS 


P.22 


CHWK060 


HYMN HL040481 8 


| 4 CALU A 


VFNI CREATOR SPIRITUS 

« 


P. 13 


CHWKO60 


HYMN HL0404778 


] ACrtLUA 
| 


VFXIL.LA REGIS P.ROQEUNT 


P.5 


CHWK06 0 


HYMN HL040 473 8 


1 .J CANOLA 


C IN GAR | SI NO 












P.13 


CHWK043 


LI EDERHL040 332 B 


J DANILA 


MADONNA NO r SAPIMO 8EN G IOCARE 


P.16 


“CHWK043 


LI E0ERHL0403 35 8 


J C AM CL A 


MEDIC I NU I S I AMD 


P.12 


_ CHWK043 


U EDERHL040331B 


joanola 

k , 


NO I TRE MADONNA STAMO 


P.15 

</ 

• . \ 


CHHK043 


LI EDERHL0403348 
1 4 ^ 

H " 

! jJ 
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EXHIBIT "C" 

i 

COMPOSER TITLE PAGE SOURCE GENRE SERIAL 0 



J CLEMNS 


SCAELCSTC BfcNEFICIUM 


P. 1 


CMM4/15 


MOTET HL020080 D 

jj 


G ASGL A 


$ CANT AO AN T SANCTl 


P.5 


A-R VI 


MOTET HL022202B 


GASOL A 


$CANTA TE $ DOM I NO 


P.113 


A-R VI 


t 

LAUC H..022223B 

| 


F GAFURS 


JCASTRA CAELI 


P,ll9 


ARCMETR 


MOTET HL005V79C 


JCLEXNS 


tC EST A GRAND TORT 


P.135 


CMM4/1U 


CHNSN HLO 19940 G 


PVERDLT 


(CHI 6UFFA 


P.31 


54 1/ 16 


MAOR IGHL04 14 31 ? 


ANON 


3CHRISTE, CUNCTORUM 


P.12 


ARCMETR 


MOTET HLO 205 2 


GWERUKE 


$CHR 1ST I $HATER, AVE 


P. 1 


ARCMETR 


MOTET UL020566B 


JARCCLT 


$COM • E SSER FUOTE AMORE 


P. 14 


541/ 16 


MACRI GHL041413B 


GOUFAY 


iCONDITUR ALME SIOERUM 


P.23 


MINOR 


HYMN HLOO 390>8 


CFEST A 


$COSl ESTREMA LA DOGL1A 


P.30 


541/ 16 


HACR IGHL0414298 


J CL EM NS 


FCREAVIT $ DEUS HOMINEM ...... 


P.73 


CMM4/15 


MOTET HL020093 6 


> 

; NGOMtfAT 


*CUR QUISQUAM CORRADAT 


P. 104 


CMM6/5 


MOTET HL030097B 


\i CLEMNS 

1 


$ U • AH Y PAR FA IC T 


P. 100 


CMM4/10 


CHNSN HLO 19933B 


LSENFL 


$DAS iGLAUT ZU SSPEYER 


P.67 


HI NOR 


LI EDERHL0 0 39 11 B 


MJHAN 


$ 0 EH PERCHE NON E IN VOI 


P. 11 


54 1/ 16 


MAOR IGHL04 14 10 B 


: ANJN 


$DEH PERCHE NON E IN VOI 


P.22 


541/ 16 


MACRIGHL 041421 3 


FGAFURS 


$ C ESC ENl) I IN HORTUM 


P.31 


ARCMETR 


MOTET FL0099 49 fi 


GASOLA 


JOEUS CANTICUM NOVUM 


P.109 


A-R VI 


LAUOE FL022222B 


ANON 


3.0EUS, CREATOR OMNIUM 
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